Microsoft Word - A New Language Model For Automatic Arabic Speech Recognit¡¦

نویسنده

  • Mohsen Rashwan
چکیده

A new language model for Arabic language for large vocabulary automatic speech recognition (ASR) is introduced. The derivative future of the Arabic word is quite useful in dividing the process into two phases. In phase-1 the fixed words, the prefix, the suffix and the form of the derivative words are determined through phase-1M-gram, of course, given the acoustical data. In phase 2 another M-gram is used to determine the roots of the derivative words. The idea was tested on 60 words (10 roots x 6 forms). Results are encouraging the idea, and more work is to follow to realize a complete large vocabulary ASR for Arabic language.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Construction of Persian ICT WordNet using Princeton WordNet

WordNet is a large lexical database of English language, in which, nouns, verbs, adjectives, and adverbs are grouped into sets of cognitive synonyms (synsets). Each synset expresses a distinct concept. Synsets are interlinked by both semantic and lexical relations. WordNet is essentially used for word sense disambiguation, information retrieval, and text translation. In this paper, we propose s...

متن کامل

Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model

In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...

متن کامل

Completion of Japanese sentences by inferring function words from content words

A method of generating a Japanese sentence by inferring funct ion words from content words using valency pa~terns is presented. A procedure for selecting an appropriate function word, on the assumption that correct content words have been selected for a given phrase lattice, is described. A method ol ~ inferr ing a correct verb when verbs are recognized less accurately than nouns by the speech ...

متن کامل

Speech Recognition System of Arabic Digits based on A Telephony Arabic Corpus

Automatic recognition of spoken digits is one of the difficult tasks in the field of computer speech recognition. Spoken digits recognition process is required in many applications such as speech based telephone dialing, airline reservation, automatic directory to retrieve or send information, etc. These applications take numbers and alphabets as input. Arabic language is a Semitic language tha...

متن کامل

Arabic speaker-independent continuous automatic speech recognition based on a phonetically rich and balanced speech corpus

This paper describes and proposes an efficient and effective framework for the design and development of a speaker-independent continuous automatic Arabic speech recognition system based on a phonetically rich and balanced speech corpus. The speech corpus contains a total of 415 sentences recorded by 40 (20 male and 20 female) Arabic native speakers from 11 different Arab countries representing...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004